Nonparametric Estimation of Component Distributions in a Multivariate Mixture by Peter Hall
نویسنده
چکیده
Suppose k-variate data are drawn from a mixture of two distributions, each having independent components. It is desired to estimate the univariate marginal distributions in each of the products, as well as the mixing proportion. This is the setting of two-class, fully parametrized latent models that has been proposed for estimating the distributions of medical test results when disease status is unavailable. The problem is one of inference in a mixture of distributions without training data, and until now it has been tackled only in a fully parametric setting. We investigate the possibility of using nonparametric methods. Of course, when k = 1 the problem is not identifiable from a nonparametric viewpoint. We show that the problem is “almost” identifiable when k = 2; there, the set of all possible representations can be expressed, in terms of any one of those representations, as a twoparameter family. Furthermore, it is proved that when k ≥ 3 the problem is nonparametrically identifiable under particularly mild regularity conditions. In this case we introduce root-n consistent nonparametric estimators of the 2k univariate marginal distributions and the mixing proportion. Finite-sample and asymptotic properties of the estimators are described.
منابع مشابه
Mixtures of location-shifted symmetric distributions
This paper considers a nonparametric approach to fitting mixture distributions that assumes only that the components are symmetric and come from the same location family. Unlike some other nonparametric treatments of mixtures in the literature, our approach assumes univariate rather than multivariate observations. We discuss sufficient conditions for the identifiability of these mixture models,...
متن کاملAdaptive Estimation in Elliptical Distributions with Extensions to High Dimensions
The goal of this paper is to propose efficient and adaptive regularized estimators for the nonparametric component, mean and covariance matrix in both high and fixed dimensional situations. Although, semiparametric estimation of elliptical distribution has also been discussed in [8], we wish to expand the model in two ways. First, study adaptive estimation methods with a novel scheme of estimat...
متن کاملDetermination of the number of components in finite mixture distribution with Skew-t-Normal components
Abstract One of the main goal in the mixture distributions is to determine the number of components. There are different methods for determination the number of components, for example, Greedy-EM algorithm which is based on adding a new component to the model until satisfied the best number of components. The second method is based on maximum entropy and finally the third method is based on non...
متن کاملStatistical Wavelet-based Image Denoising using Scale Mixture of Normal Distributions with Adaptive Parameter Estimation
Removing noise from images is a challenging problem in digital image processing. This paper presents an image denoising method based on a maximum a posteriori (MAP) density function estimator, which is implemented in the wavelet domain because of its energy compaction property. The performance of the MAP estimator depends on the proposed model for noise-free wavelet coefficients. Thus in the wa...
متن کاملApproximate nonparametric maximum likelihood for mixture models: A convex optimization approach to fitting arbitrary multivariate mixing distributions
Nonparametric maximum likelihood (NPML) for mixture models is a technique for estimating mixing distributions, which has a long and rich history in statistics going back to the 1950s (Kiefer and Wolfowitz, 1956; Robbins, 1950). However, NPMLbased methods have been considered to be relatively impractical because of computational and theoretical obstacles. Recent work focusing on approximate NPML...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003